Weighted multi-view key-frame extraction

نویسندگان

  • Antonis Ioannidis
  • Vasileios Chasanis
  • Aristidis Likas
چکیده

The extraction of representative key-frames from video shots is very important in video processing and analysis, since it constitutes the basis for several important tasks such as video shot summarization, browsing and retrieval as well as high-level video segmentation. The extracted key-frames should capture a great percentage of the information of a shot content, while at the same time they should not present similar visual information. Clustering or segmentation methods are usually employed to extract key-frames. A major difficulty is caused by the large variety in the visual content of videos. Thus, using a single image descriptor (color, texture etc) to extract key-frames is not always effective, since there is no single descriptor surpassing the others in all video cases. To tackle this problem, we propose an approach for the weighted fusion of several descriptors that automatically estimates the weight of each descriptor. The weights reflect the relevance of each descriptor for the specific video shot. Moreover, they are used to form a composite similarity matrix as the weighted sum of all the similarity matrices corresponding to the individual descriptors. This matrix is then used as input to a spectral clustering algorithm that partitions shot frames into groups. Finally the medoid frame of each group is selected as key-frame. Numerical experiments using a variety of videos demonstrate that our method is capable of efficiently summarizing video shots regardless of the characteristics of the visual content of a video. © 2016 Elsevier B.V. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

FudaSys Video Retrieval in TRECVID 2012

The video retrieval system we developed for TRECVID 2012 mainly involves the semantic indexing task which includes key frame extraction, low level feature extraction, classification and concept fusion. We extracted a new low level feature, explored various classification and fusion schemes. Four “light” runs and two 2 “pair” runs we submitted are as follows: L_A_FudaSys1: Fusion based on concep...

متن کامل

Dominant Color Based Extraction of Key Frames for Sports Video Summarization

This paper proposes a novel approach of dominant color based extraction of key frames for sports video summarization. The visual features have been used to obtain play field color shots and non-play field color shots. For the every play field color shots dominant colored key frame has been extracted using color histogram analysis and created video summary. This provides users a way to swiftly b...

متن کامل

The Key Frame Extraction Algorithm Based on the Indigenous Disturbance Variation Difference Video

In view of the traditional support vector machine (SVM) learning algorithm’s learning parameters is not easy determined in the process of video key frame extraction and the accuracy is low, an independent perturbation variable difference SVM algorithm is used for video key frame extraction. First of all, analyzed the biological mechanism of differential evolution algorithm, put forward an impro...

متن کامل

Pseudo Zernike Moment-based Multi-frame Super Resolution

The goal of multi-frame Super Resolution (SR) is to fuse multiple Low Resolution (LR) images to produce one High Resolution (HR) image. The major challenge of classic SR approaches is accurate motion estimation between the frames. To handle this challenge, fuzzy motion estimation method has been proposed that replaces value of each pixel using the weighted averaging all its neighboring pixels i...

متن کامل

Content-Based Retrieval of Video Surveillance Scenes

A novel method for content-based retrieval of surveillance video data is presented. The study starts from the realistic assumption that the automatic feature extraction is kept simple, i.e. only segmentation and low-cost filtering operations have been applied. The solution is based on a new and generic dissimilarity measure for discriminating video surveillance scenes. This weighted compound me...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Pattern Recognition Letters

دوره 72  شماره 

صفحات  -

تاریخ انتشار 2016